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The theme of this paper is the derivation of analytic formulae for 
certain large combinatorial structures. The formulae are obtained via 
fluid limits of pure jump-type Markov processes, established under 
simple conditions on the Laplace transforms of their Levy kernels. 
Furthermore, a related Gaussian approximation allows us to describe 
the randomness which may persist in the limit when certain param¬ 
eters take critical values. Our method is quite general, but is ap¬ 
plied here to vertex identifiability in random hypergraphs. A vertex 
v is identifiable in n steps if there is a hyperedge containing v all 
of whose other vertices are identifiable in fewer steps. We say that 
a hyperedge is identifiable if every one of its vertices is identifiable. 
Our analytic formulae describe the asymptotics of the number of 
identifiable vertices and the number of identifiable hyperedges for a 
Poisson (/?) random hypergraph A on a set V of N vertices, in the 
limit as N —> oo. Here /3 is a formal power series with nonnegative 
coefficients /3o, /?i, - - -, and (A(A))acv are independent Poisson ran¬ 
dom variables such that A(A), the number of hyperedges on A, has 
mean whenever |A| = j. 


1. Introduction. 

1.1. Motivation. We are interested in the evolution of certain statisti¬ 
cally symmetric random structures, extended over a large finite set of points, 
when points are progressively removed in a way which depends on the struc¬ 
ture. The initial condition of the structure may allow few possibilities for 
the removal of points; indeed, it may be that, once a small proportion of 
points are removed, the process terminates. On the other hand, the removal 
of points may cause the structure to ripen, eventually yielding a large pro¬ 
portion of the initial points. Our analysis will enable us to demonstrate a 
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sharp transition between these two sorts of behavior as certain parameters 
pass through critical values. 

Let us illustrate this phenomenon by a simple special case. Consider the 
complete graph on N vertices and declare each vertex to be open with 
probability p, each edge to be open with probability a/N. Suppose that we 
are allowed to select an open vertex, remove it, and declare open any other 
vertices sharing an open edge with the selected vertex. If we continue in 
this way until no open vertices remain, we eventually remove every vertex 
connected to an open vertex by open edges. We shall see that the proportion 
of vertices thus removed converges in probability as IV —> oo and that the 
limit z*(p,a) is the unique root in [0,1) of the equation 

az + log(l — z) = log(l — p ). 

Thus, for small values of p, there is a dramatic change in behavior as a 
passes through 1. As p | 0, for a < 1, 

z*ip,oc)/p^ 1/(1 -a), 


but for a > 1, 


z*( 0+, a) > 0. 

Of course, this is a reflection of well-known connectivity properties of random 
graphs, discovered by Erdos and Renyi [7], and discussed, for example, in 
[3], 

The class of models considered in this paper is a natural generalization 
of some classical models of random graphs and hypergraphs, which may 
be further motivated as follows. Phase transitions in combinatorial prob¬ 
lems constitute an area of active research among computer scientists. Many 
“hard” combinatorial problems can be cast as satisfiability problems, which 
seek to assign a truth value to each of a set of Boolean variables, such that a 
collection of logical conjunctions are simultaneously satisfied. Phase transi¬ 
tions for random satisfiability (“random k-SAT”) problems have been stud¬ 
ied by researchers at Microsoft [1, 2, 15] and IBM [4], but difficult questions 
remain unanswered. The random hypergraph model herein may be viewed 
as a simplification of the random satisfiability model: a vertex corresponds 
to a Boolean variable, and a hyperedge to the set of variables appearing in 
a specific logical conjunction, neglecting the truth or falsehood assigned to 
those variables. Under this simplification, definitive critical parameters are 
obtained which shed light on the random satisfiability model, and whose 
derivation may serve as a template for analysis of mixed satisfiability prob¬ 
lems. 
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1.2. Hypergraphs. Let V be a finite set of N vertices. By a hypergraph 
on V we mean any map 

A:V{V)-*1 + . 

Here Z + denotes the set of nonnegative integers. The reader may consult 
[6] for an overview of the theory of hypergraphs; however, the direction pur¬ 
sued here is largely independent of previous work. We emphasize that, in 
distinction to much of the combinatorial literature on hypergraphs, we al¬ 
low the possibility that more than one edge is assigned to a given subset; 
thus we are considering multi-hypergraphs. Moreover, we do not insist that 
all hyperedges have the same number of vertices. Much of the literature is 
restricted to this uniform case. Our methods allow a significant broadening 
of the class of models for which asymptotic computations are feasible. Hy¬ 
peredges over vertices are called patches (loops in [6]) and hyperedges over 
0 are called debris. The total number of hyperedges is 

|A| = £a(a). 

A 

1.3. Accessibility and identifiability. Interest in large random graphs has 
often focused on the sizes of their connected components. If there is given 
also, as in the example above, a set of distinguished vertices Vo, then it is 
natural to seek to determine the proportion of all vertices connected to Vo- 

In the more general context of hypergraphs there is more than one inter¬ 
esting counterpart of connectivity. Given a hypergraph A on a set V, we say 
that a vertex v is accessible in 1 step or, equivalently, identifiable in 1 step 
if A({u}) > 1. We say, for n = 2,3,..., that a vertex is accessible in n steps 
if it belongs to some subset A with A(A) > 1, some other element of which 
is accessible in less than n steps. A vertex is accessible if it is accessible in 
n steps for some n > 1. 

On the other hand, we say that a vertex is identifiable in n steps if it 
belongs to some subset A with A(A) >1, all of whose other elements are 
identifiable in less than n steps. A vertex is identifiable if it is identifiable in 
n steps for some n > 1. 

The notion of accessibility may be appropriate to some physical models 
similar to percolation, whereas identifiability is more relevant to knowledge- 
based structures. We shall examine only the notion of identifiability. 

Given a hypergraph A without patches and a distinguished vertex vq, we 
say that a vertex v is accessible from vq if it is accessible in the hypergraph 
A + that is, in the hypergraph obtained from A by adding a single 

patch at vq. Identifiability from vq is defined similarly. The set of vertices 
accessible from vq is the component of vq, as studied in [6, 11, 12, 14]. The 
set of vertices identifiable from vq is the domain of vq, as studied by Levin 
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Fig. 1. Example of a permitted collapse—deletion of one vertex. 

and the current authors [5]. We shall not consider further in this paper these 
vertex-based notions. 

The process of identification is dual to the process leading to the 2-core of 
a graph or hypergraph, that is to say, the maximal subgraph in which every 
nonisolated vertex has degree at least 2. In the former process one removes 
vertices having a 1-hyperedge, in the latter one removes edges containing a 
vertex of degree 1. In this duality, nonidentifiable vertices correspond to the 
2-core. Thus our results may be interpreted as giving the asymptotic size of 
the 2-core for a certain class of random hypergraphs. 

1.4. Hypergraph collapse. It will be helpful to think of the identification 
of vertices as a progressive activity. Once a vertex is identified, it is removed 
or deleted from the vertex set, in a manner which is explained below. Thus, 
we shall consider an evolution of hypergraphs by the removal of vertices 
over which there is a patch. A hypergraph with no patches will therefore 
be stable. Given a hypergraph A and a vertex v, we can arrive at a new 
hypergraph A' by removing v from each of the hyperedges of A. Thus 

A'Ml = ( + A (^ U M)> if V( £A, 

K ’ \0, if v€A. 

For example, in Figure 1, the patch on the central vertex is selected, and 
that vertex is removed; this causes a triangular face to collapse to an edge, 
and two edges incident to the vertex to collapse to patches on the vertices 
at the other ends. Note that this leaves two patches on the lower left vertex. 

If A({u}) > 1, then we say that A' is obtained from A by a ( permitted ) 
collapse. Starting from A, we can obtain, by a finite sequence of collapses, a 
stable hypergraph A^. Denote by V* the set of vertices removed in passing 
from A to A^. The elements of V* are the identifiable vertices. We write A* 
for the identifiable hypergraph , given by 

A* (A) = A(A)±a<zv* ■ 
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We note that V*, and hence A* and Aoo, do not depend on the particular 
sequence of collapses chosen. For, if v\,V 2 ,... and v[,v' 2 ,... are two such 
sequences, and if v n ^ v k for all k, then we can take n minimal and find 
k such that {sq,..., u n _i} C {i^,..., v' k }; then, with an obvious notation, 
Jk' k ({ v n}) > A n _i({u n }) > 1, so v n must, after all, appear in the terminating 
sequence v[,v 2 ,..., a contradiction. We note also that V* increases with A. 

1.5. Purpose of this paper. The main question we shall address is to 
determine the asymptotic sizes of V* and A* for certain generic random 
hypergraphs, as the number of vertices becomes large. We note that, since 
the number of hyperedges is conserved in each collapse, all the identifiable 
hyperedges eventually turn to debris: 

Aoo(0) = | A* |. 

Note that V* depends only on min{A,l}. In the case where A(A) = 0 
for |A| >3, the hypergraph min{A, 1} may be considered as a graph on V 
equipped with a number of distinguished vertices. Then V* is precisely the 
set of vertices connected in the graph to one of these distinguished vertices. 

1.6. Poisson random hypergraphs. Let P) be a probability space. 

A random hypergraph on V is a measurable map 

A:Hx V(V) —► Z + . 

An introduction to random hypergraphs may be found in [11], though we 
shall pursue rather different questions here. We shall consider a class of 
random hypergraphs whose distribution is determined by a sequence (3 = 
(Pj : j E Z + ) of nonnegative parameters. Say that a random hypergraph A 
on V is Poisson(/3) if: 

1. the random variables A (A), A C V, are independent; 

2. the distribution of A(A) depends only on |A|; 

3 - E|A|=;A(A) ~ Poisson(A r (3j), j = 0,1,... ,N. 

A consequence of these assumptions is that A(A) has mean Nfjj/ ('J) when¬ 
ever |A| = j. Note that, when N is large, for j > 2, only a small fraction of 
the subsets of size j have any hyperedges, and those that do usually have 
just one. Also the ratio of j-edges to vertices tends to f3j. Our assumption 
of Poisson distributions is a convenient exact framework reflecting behavior 
which holds asymptotically as N —» oo under more generic conditions. 

1.7. Generating function. A key role is played by the power series 

P(t)=j2(3jt j 

j> 0 


(1) 


6 


R. W. R. DARLING AND J. R. NORRIS 


and by the derived series 

P'(t) = Y,jW-\ 

j> i 

P"(t) = '52 jU- 

3> 2 

Let /? have radius of convergence i?. The function / 3 / (t) + log(l — t) may have 
zeros in [0,1) but these can accumulate only at 1. Set 

(2) z* = inf{t G [0,1) : ft (t) + log(l — t) < 0} A 1 

and denote by ( the set of zeros of (3'{t) + log(l — t) in [0, z*). Note that 
if /3 is a polynomial, or indeed if R >1, then z* < 1. Also, the generic and 
simplest case is where £ is empty. 

2. Results. We state our principal result first in the generic case. 

2.1. Hypergraph collapse—generic case. 

Theorem 2.1. Assume that z* < 1 and ( = 0. For A r G N, let V N be 
a set of N vertices and let A N be a Poisson(/3) hypergraph on V N . Then, 
as N —» oo, the numbers of identifiable vertices and identifiable hyperedges 
satisfy the following limits in probability: 

\V n *\/N^z*, |A|/iV —► (3(z*) — (1 — z*) log(l — z*). 

Example 2.1. The random graph with distinguished vertices described 
in the Introduction corresponds to a Poisson(/3 Ar ) hypergraph A N , where 

1 _ e -P? =pj i _ e -2 tfKN-i) = a/N 

and /3j* = 0 for j > 3. Note that = fi\ and Qzf —»• fa as N —>• oo, where 
f3\ = — log(l — p) and /?2 = ot/2. Theorem 2.1 extends easily to cases where 
f3 depends on N in such a mild way: one just has to check that Lemma 
6.1 remains valid and note that this is the only place that (3 enters the 
calculations. We have /3(f) = —tlog(l — p) + t 2 ct/2 so 

/3'(t) + log(l -t) = -log(l —p) + ta + log(l -t). 

Then z* is the unique f G [0,1) such that 

at + log(l -t) = log(l - p) 

and C is empty, so \V* n \/N —» z* in probability as N —> oo, as stated above. 
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Fig. 2. Changes in behavior near a critical parameter value. 

Example 2.2. To illustrate critical phenomena, let f3(t) = a(0.1 + 0.9f) 7 . 
Let x, y and 2 refer to the re-scaled number of vertices eliminated, the num¬ 
ber of patches and the amount of debris, respectively; here “re-scaled” means 
after division by the number of vertices. Plots of y and z versus x are shown 
in Figure 2, for the choices a = 1185 (solid) and a = 1200 (dashed). In the 
case a = 1185, y hits zero when x ~ 0.02, and so 2 remains stuck at about 
0.02. A very small increase in a, from 1185 to 1200, causes a dramatic change 
in the outcome: after narrowly avoiding extinction (Figure 2), the number 
of patches explodes (Figure 3) as x increases toward 1. 

Consider what the figures tell us about the supercritical case a = 1200: 
during the first 4% of patch selections, there is rarely any other patch cover¬ 
ing the same vertex as the one selected; Figure 3 shows that, during the last 
10% of patch selections, an average of 5792 other patches cover the same 
vertex as the one selected. (Read the labels on the x-axes carefully: Figure 
2 is a close-up of the leftmost 4% of the scale of Figure 3.) 


2.2. Hypergraph collapse—general case. In order to describe an exten¬ 
sion of Theorem 2.1 to the case where ( is nonempty, we introduce the 
random variable 

Z = min{z £ (:W(z/(l — z)) < 0} A z*, 
where (Wt)t >o is a Brownian motion. 

Theorem 2.2. Assume that R (. Then, for V N * and A N * as in The¬ 
orem 2.1, the following limits exist in distribution: 

\V n *\/N -> Z, \A n *\/N -*• P(Z) - (1 - Z) log(l - Z). 

In the case where ( has only a single point Co < z*, then Z is equal to 
Co with probability \ and equal to z* with probability We do not know 
what happens when R £ C- Proofs will be given in Section 6. 
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3. Randomized collapse. We introduce here a particular random rule 
for choosing the sequence of moves by which a hypergraph is collapsed, 
which has the desirable feature that certain key statistics of the evolving 
hypergraph behave as Markov chains. It is by analysis of the asymptotics of 
these Markov chains as N —> oo that we are able to prove our main results. 

3.1. Induced hypergraph. Let A be a Poisson(/3) hypergraph. For S C V 
with | S'! = n, let A s be the hypergraph obtained from Ao by removing all 
vertices in S. Thus, for A C V\S with |A| = j, 

A S (A)= J2 A (B)~P(\j(N,n)), 

BDA,B\S=A 

where the Poisson parameter \j(N,n ) is computed as follows: there are ("j 
ways to choose S n B such that \B\ = i + j, and the Poisson parameter of 
MB) is N0j +i /( i ^.), so 

A AN.n^Np^)/^.). 

Moreover, the random variables A S (A), AC.V\S, are independent. 


3.2. Rule for randomized collapse. Recall that the sequence of vertices 
chosen to collapse a hypergraph is unimportant, provided we keep going 
until there are no more patches. However, we shall use a specific randomized 
rule which turns out to admit a description in terms of a finite-dimensional 
Markov chain. This leads to a randomized process of collapsing hypergraphs 
(A n ) n > 0 . This will prove to be an effective means to compute the numbers 
of identifiable vertices and identifiable hyperedges for Aq. 

The process (A n ) n >o, together with a sequence of sets (S n ) n >o such that 
An = A Sn , is constructed as follows. Let Sq = 0 and Aq = A. Suppose that 


y versus x: a = 1200 


0 0.1 02 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 


z versus x: a»1200 


0 0.1 0.2 03 0.4 0.5 0.6 0.7 0.8 0.9 1 


Fig. 3. Patches and debris in the supercritical regime. 
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S n and A n have been defined. If there are no patches in A n , then £> n +i = S n 
and A n+ i = A n . If there are patches in A n , select one uniformly at random 
and denote by v n+ \ the corresponding vertex; then set 5 n+ i = S n U {u n+ i} 
and A n+ i = A Sn+1 . 

3.3. An embedded Markov chain. Let Y n denote the number of patches 
and Z n the amount of debris in A n . Then Y n = 0 and Z n = |A*| for n > 
|V*|. Also |V*| = inf{n > 0: T n = 0}. Let W n+ \ denote the number of other 
patches at time n sharing the same vertex as the (n + l)st selected patch, 
and let U n +i denote the number of 2-edges at time n containing the [n + 
l)st selected vertex v n +i. Our analysis will rest on the observation that 
(Y n , Z n ) n > o is a Markov chain, where, conditional on Y n = m > 1 and Z n = k, 
we have 

Yn +1 = Y n — 1 — W n +i + U n+1 , Z n .)-i = Z n + 1 + W n +1 

and where W n+ \ ~ B(m— 1, 1/(N — n)) and U n+ 1 ~ P((N — n— 1)A2 (N,n)) 
with W n - |_i and U n+ \ independent. 

To see this, introduce the filtration 

T n = a(S ri Y r ,Z, r :r = 0,1,...,n). 


Lemma 3.1. Let 


p(A\S,k,m ) = P 


A s = A| ^ A 5 ({u}) = m, A s (0) = k 


Then 


F[A n = \\F n \=p(A\S n ,Y n ,Z n ). 

Equivalently, for all B G T n so that B C { S n = S,Y n = m, Z n = k}, 
P[A n = A, B] = p{X\S, m, k)F[B}. 

The claimed Markov structure for (Y n ,Z n ) n >q follows easily. 


Proof of Lemma 3.1. The identity is obvious for n = 0. Suppose it 
holds for n. Let B C {S n = S,Y n = m, Z n = k} . Take x G V\S, m' > 1 and 
k! > k. Set S' = S U {x} and B' = {^n+i = S', Y n+ \ = m' , Z n+ \ = k'} n B. It 
will suffice to show, for all hypergraphs X r having m' patches and amount 
of debris k! , that 

P[A n+1 = A / ,5 , ]ocp(A'|5 , ,m , ,A- / ) ! 

where oc denotes equality up to a constant independent of A'. But 
P[A n+1 = A', B'] = Y, k —^nA n = A, B] oc Yp( X \ s , m, k ), 

A m A 
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where the sum is over all hypergraphs A which collapse to A ; on removing 
the vertex x. Let Y S ,Z S denote the number of patches, amount of debris 
in A s , respectively. Since Y s and Z s are conditionally independent of A s 
given Y s ' and Z s \ 

J2p( AIS, m, k ) = P(A S ' = A'|F 5 = m ,Z s = k) oc p(\'\S', rri, k 1 ), 

A 

as desired. □ 

4. Exponential martingales for jump processes. We recall here some 
standard notions for pure jump Markov processes in M. d and their associ¬ 
ated martingales. These will be used to study the fluid limit of a sequence 
of such jump processes in Section 5. 

4.1. Laplace transforms. Let (Xt)t>o be a pure jump Markov process 
taking values in a subset I of WL d , with Levy kernel K. Consider the Laplace 
transform 

m(x,9)= [ e {d ’ y) K(x,dy), 0 e (R rf )*, 

J R d 

and assume that, for some rjo > 0, 

(3) sup sup m(x, 6) < C < oo. 

x£l | 6 »|< 77 o 

The distribution of the time T and displacement A Xt of the first jump of 
(X t ) t > o is given by 

P(T G dt, A Xt G dy\T > t, Xq = x) = K{x, dy) dt. 

Introduce random measures p and v on (0, oo) x M d , given by 

d= £ dAXt)i 

AAt^O 

v(dt,dy) = K(X t _,dy) dt, 

where £{t,y) denotes the unit mass at (t, y)\ v is thus the compensator of the 
random measure p, in the sense of ([10], page 422). 

4.2. Martingales associated with jump processes. The fact that v is a 
compensator implies that, for any previsible process a:Q x (0,oo) x M d —> R 
satisfying 

E / |a(s, y)\v(ds, dy) < oo, 

J K d 
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the following process is a martingale: 

rt 


[ [ a(s,y){/i-v)(ds,dy). 

JO JR d 

In particular, (3) allows us to take a(s,y) = y, which gives the martingale 

M t = / / y(y — v)(ds, dy). 

Jo Js. d 

Fix rj E (0, 770 )- Then there exists A < 00 such that 
(4) \m"(x,6)\<A, x£l,\0\ < 77 , 

where “ / ” denotes differentiation in 9. Dehne for 9 E 

°) = [ {e- {9 ' v) ~ 1 - ( 0 , 2 /)}^(ah dy). 

Ju d 

Then 4> > 0 and, for |#| < 77 , by the second-order mean value theorem, 

4>(x,9)= [ m"(x,rd)(6,6)(l — r)dr 
Jo 


so 


4>{x,9) <\A\9\ 2 , x E /, |0| < 77 . 

Let {9 t )t>o be a previsible process in with |0 t | <77 for all f. Set 

(5) = Zf = exp{/V s , dM s ) - J* <f>(X a ,6 s ) cfa}. 

Then (Z t )t >0 is locally bounded, and by the Doleans formula ([10], page 
440), 


Z t = 1 + 


[ [ Z s ^(e ( ' 6s ’ y '> - l)(/x - u)(ds,dy). 
Jo JR d 


Hence (Zt)t >0 is a nonnegative local martingale, so E (Zt) < 1 for all t. Hence 


E 


I I \Z s ^(e ( ' 9s ’ y ' > - l)\v(ds,dy) 

Jo J R d 

< E f Z s (m(X s ,9 s ) + m(X s , 0)) ds < 2 Ct 
Jo 


so (Zf)t>o is a martingale. 

Proposition 4.1. For all 5 e (Oj^f-y/d], 

p(sup|M s | > 5 ) < (2 d)e~ 52/( - 2Adt) . 


S<t 
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Proof. Fix 9 G with \6\ = 1 and consider the stopping time 

T = mf{t>0:(9,M t )>5}. 

For £ <tj, taking Qt = 0 for all t above, we know that o is a martingale. 

On the set {T < t} we have Zq? > (Ze-Ate 2 /2 . gy optional stopping, 

nz e T \ t )=nzf) = i. 

Hence, 

P^sup (6,M a ) >5^j= P (T <t)< e~ 5e+At£2 l 2 . 

When 5 < Atrf we can take e = d/At to obtain 

P^sup(0,M s ) > ^ < e~ s2/2At . 

Finally, if sup s<t \M S \ > 6 , then sup s<t (0, M s ) > S/Vd for one of 8 = ±ei,.. ., Fed- 
□ 


5. Fluid limit for stopped processes. In this section we develop some 
general criteria for the convergence of a sequence of Markov chains in M. d 
to the solution of a differential equation, paying particular attention to the 
case where the chain may stop abruptly on leaving a given open set. 

5.1. Fluid limits. It is possible to give criteria for the convergence of 
Markov processes in terms of the limiting behavior of their infinitesimal char¬ 
acteristics. This is a powerful technique which has been intensively studied 
by probabilists. The book of Ethier and Kurtz [8] is a key reference. Further 
results are given in Chapter 17 of [10] and in Theorems IX.4.21 and IX.4.26 
of [9]. A particular case with many applications is where the limiting process 
is deterministic and is given by a differential equation, sometimes called a 
fluid limit. The relevant probabilistic literature, though well developed, may 
not be readily accessible to nonspecialists seeking to apply the results in 
other fields. One field where fluid limits of Markov processes are beginning 
to find interesting applications is random combinatorics. Wormald [16] and 
co-workers have put forward a set of criteria which is specially adapted to 
this application. The material in this section may be considered as an alter¬ 
native framework, somewhat more rigid but, we hope, easy to use, developed 
with the same applications in mind. 

Let (X/ v )t>o be a sequence of pure jump Markov processes in M d . It may 
be that takes values in some discrete subset I N of M d and that its 

Levy kernel K N (x,dy) is given naturally only for x G I N . So let us suppose 
that I N is measurable, that o takes values in I N and that the Levy 
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kernel K N (x,dy ) is given for x £ I N . Let S be an open set in W l and set 
S N = I N nS. We shall study, under certain hypotheses, the limiting behavior 
of (X?) t>o as N —y oo, on compact time intervals, up to the first time the 
process leaves S. In applications, the set S will be chosen as the intersection 
of two open sets H and U. Our sequence of processes may all stop abruptly 
on leaving some open set H, so that K N (x,dy ) = 0 for x <£. H. If this sort 
of behavior does not occur, we simply take H = R d . We choose U so that 
the conjectured fluid limit path does not leave U in the relevant compact 
time interval. Subject to this restriction we are free to take U as small as 
we like to facilitate the checking of convergence and regularity conditions, 
which are required only on S. 

The scope of our study is motivated by the particular model which oc¬ 
cupies the remainder of this paper: so we are willing to impose a relatively 
strong, large-deviations-type hypothesis on the Levy kernels K N , see (6) 
below, and we are interested to find that strong conclusions may be drawn 
using rather direct arguments. On the other hand, in certain cases of our 
model, the fluid limit path grazes the boundary of the set S; this calls for 
a refinement of the usual fluid limit results to determine the limiting distri¬ 
bution of the exit time. 


5.2. Assumptions. Consider the Laplace transform 


m N (x,0)= [ e^K N {x,dy\ x £ S N ,9 £ (R d )*. 

J R d 


We assume that, for some t/q > 0, 


( 6 ) 


sup sup sup 

N x£S n \e\<rj 0 


m N (x, NO ) 
N 


< oo. 


Set b N (x) =m N/ (x, 0), where denotes the derivative in 6. We assume 
that, for some Lipschitz vector field 6 on S', 

(7) sup \b N (x) — b(x)\ 0. 

xeS N 


We write b for some Lipschitz vector field on M. d extending b. [Such a b is 
given, e.g., by b(x) = sup {b(y) — K\x — y\: y £ S} where K is the Lipschitz 
constant for b.} Fix a point xo in the closure S of S and denote by (xt)t >o 
the unique solution to x t = b(x t ) starting from xq. We assume finally that, 
for all 5 > 0, 

(8) limsup A r_1 logP(|X(^ — xq\ > <5) < 0. 

TV—xx) 

While these are not the weakest conditions for the fluid limit, they are 
readily verified in many examples of interest. In particular, we will be able 
to verify them for the Markov chains associated with hypergraph collapse 
in Section 3. 
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5.3. Exponential convergence to the fluid limit. Fix to > 0 and set 
T N = inf{t>0:X t N (£S} A t 0 . 

Proposition 5.1. Under assumptions (6)-(8), we have, for all S > 0, 

(9) limsup JV -1 logpf sup \X^ — x t | > 5\ <0. 

N —>oo \t<T N / 

Proof. The following argument is widely known but we have not found 
a convenient reference. Set b N (x ) = m N '(x, 0) and define (Af/ v ) t >o by 

X t N = X^ + M t N + f b N {X*) ds. 

Jo 

Note that corresponds to the martingale we identified in Proposition 
4.1. Fix r) E ( 0 , 770 ). Assumption (6) implies that there exists C < 00 such 
that, for all N, 

\m N "(x,0)\<C/N, x€S n ,\0\<Ntj. 

Compare this estimate with (4). By applying Proposition 4.1 to the stopped 
process {X^ TN )t> 0 , we find constants £0 > 0 and Co < 00 , depending only 
on C,rj,d and to such that, for all N and all £ E (0, £ 0 ], 

(10) f( sup \M t N \ > e) < C 0 e~ N£2/c °. 

\t<T N / 

Given 5 > 0, set £ = vain{e~ Kt °5/3, £ 0 }, where K is the Lipschitz constant 
of b. Let 

= { \Xq — xo| < £ and sup \M^\ < eI. 

I t<T N J 

Then (8) and (10) together imply that 

limsupIV -1 logP(fl\fl Ar ) < 0. 

TV—kx) 

On the other hand, by (7), there exists No such that \b N (x) — 6(x)| <e/to 
for all x E S N and all N > Nq. We note that 

X* f -x t = (Xg r -x 0 )+M t N + [\b N (X?)-b(X?))ds+ f\ b(X?)-b(x 8 ))ds 

Jo Jo 

so, for N > No, on Q N , for t < T N , 

\X? -x t \<3e + K /* |Xf - x B \ds, 

Jo 

which implies, by Gronwall’s lemma, that sup t<T N \X^ — xf\ < 5. □ 
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5.4. Limiting distribution of the exit time. The remainder of this section 
is concerned with the question, left open by Proposition 5.1, of determining 
the limiting distribution of T N . Set 

r = inf{t > 0: xt ^ S} A to, 

T = {t€ [0, r): x t £ 5}. 

It is straightforward to deduce from (9) that, for all 6 > 0, 

(11) limsupIV -1 logpf inf \T N — t\ > <5^) < 0. 

N—too \tSTU{r} ) 

In particular, if T is empty, then T N —> r in probability and, for all 8 > 0, 

limsupIV -1 logP( sup \Xf — xt /\ T | > 8 ) < 0. 

N —>oo \t<to J 

The reader who wishes only to know the proof of Theorem 2.1 may skip to 
Section 6 as the remaining results of this section are needed only for the 
more general case considered in Theorem 2.2. 


5.5. Fluctuations. We assume here that 
(12) T is finite. 

In this case the limiting distribution of T N may be obtained from that of 
the fluctuations 7 f w = \fN (X^ tN — x tAT N ). We assume that there exists a 
limit kernel K(x,dy), defined for x G S, such that m(x,6) < oo for all x E S 
and 10| < r] 0 , where 

m(x, 0) = f e {e ’ y) K(x,dy), x£S,0£(R d )*. 

J R d 

For convergence of the fluctuations we assume 


(13) 

(14) 

(15) 


7 ^ 


sup sup 

x£S n |0|<?7O 


m N (x, NO) 


N 


— m(x, 0) 


sup VN\b N (x ) — b(x) | 

xeS N 


7o in distribution, 

0, 

0 , 


(16) a is Lipschitz and b is C 1 on S, 

where b N (x ) = m N '(x, 0) and a(x) = m"(x,0). Of course (14) will force 
b{x) = m'(x, 0). 





16 


R. W. R. DARLING AND J. R. NORRIS 


5.6. Limiting stochastic differential equation. Consider the process (7 t)t<r 
given by the linear stochastic differential equation 


(17) d'ft = cr(x t )dB t + \7b(x t )'ytdt 


and starting from 70 , where B is a Brownian motion and a(x)a(x)* = a(x). 
The distribution of (7 t)t<r does not depend on the choice of a. For conver¬ 
gence of T n we assume, in addition, 


(18) 


dS is C 1 at xt with inward normal nt and 
IP((^)7t) = 0) = 0 for all t G T. 


Theorem 5.1. Under assumptions (8), (12)-(16) and (18) we have 
T n — s- T in distribution, where 

T = min{f € T : (nt, 7 1 ) < 0} A r. 


Proof. Let to = 0 and write the positive elements of T as 77 < • • • < r m . 
Define, for k = 0,1,... ,m, 


7 k 


Ik 


lr k i ^ T N >r k , 

d, otherwise, 

7 r k , if T>r k , 

d, otherwise, 


where d is some cemetery state. We will show by induction, for k = 0,1,..., m, 
that 

(19) (7^, • • •, 7^) ->■ (70, • • •, 7 k) in distribution. 

Given (11), this implies that T N —> T in distribution, as required. 

Note that both ( 7 ^) 0 <k<m and ( ; %)o<k<m may be considered as time- 
dependent Markov processes. Hence, by a conditioning argument, it suffices 
to deal with the case where 70 is nonrandom. By (18), if xq G dS , we can 
assume that dS is C 1 at xq and (noj 7 o) f 0- Moreover, for the inductive 
step, it suffices to consider the case where % is nonrandom, not d, and to 
show that, if 7 ^ —► 7 /t in probability, then 7 £ +1 —> 7^+1 in distribution. We 
lose no generality in considering only the case k = 0 . 

We have assumed that 7 ^ —> 70 in distribution. Note that T = 0 if and 
only if xq € dS and (no, 70 ) < 0. On the other hand, since Xq = xq + Vn 7 ^, 
we have P(T A = 0) —> 1 if and only if xq G dS and (no, 70 ) < 0. Hence 7 ^ —> 
70 in distribution; that is, (19) holds for k = 0. 

In Lemmas 5.4-5 .6 below, we will show that, if xq G S, or xo G dS and 
(no, 7 o) > 0 , then 


P(T Ar > e) —> 1 for some e > 0, 
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and, in the case m > 1, 

—> y T1 in distribution, 
p (( n n,Yn) ^ 0 and tN ^ T i) °> 

P(Ki,7n> < d and ^ V > Tl ) 

It follows that y^ —> yi in distribution, so (19) holds for k = 1. This estab¬ 
lishes the induction and completes the proof. □ 

We remark that the same proof applies when the Levy kernels K N have 
a measurable dependence on the time parameter t, subject to obvious mod¬ 
ifications and to each hypothesis holding uniformly in t < to- 

For the remainder of this section, the assumptions of Theorem 5.1 are in 
force and yo is nonrandom. 

Lemma 5.2. For all e> 0 there exists A < oo such that, for all N, 

P( sup |yf| > A J <e. 

\t<to J 


Proof. Given e > 0, choose A < oo and Nq such that, for A' = e Kt °\/3 
and N > No, 

VN\b N (x) — b(x)\ < X'/to, x £ S N , 

and, with probability exceeding 1 — e, 

17^1 < A', 

VN sup \M t N \ < A'. 

t<T N 

This is possible by (10) and (15). These three inequalities imply 

\^\<2,\' + K f\^\ds, t<T N , 

Jo 

so, by Gronwall’s lemma 

sup| 7 f|<A. 

t<T N L-l 

Lemma 5.3. For all e > 0 there exists A < oo such that, for all 5 > 0, 
there exists Ns < oo such that, for all N > Ns and all t < to, 

p( sup |yf - yf I > Av/j) <e. 

\ s<to,t<s<t+5 J 
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Proof. Consider first the case t = 0. Given e > 0, choose A < oo such 
that, for all 5 > 0, there exists N$ such that, for A' = e Kt °X/S and N > N$, 

VN\b N (x) — b(x)\ < X'/y/to, x G S N , 

and, with probability exceeding 1 — g, 

iTo^l < A '/KVh, 

VN sup \M^\ < A 'Vs. 

t<T N /\S 

This is possible by (10) and (15). These three inequalities imply 

\% N ~ lo I < 3A'V5 + K f | 7 f - To^l ds, t<T N A 5, 

Jo 

so by Gronwall’s lemma, 

sup | 7 f - 7^1 < A\/$. 

t<T N AS 

The case t > 0 follows by the same sort of argument, using Lemma 5.2 to 
get the necessary tightness of 7 ^. □ 

Lemma 5.4. Suppose either xq G S, or xq G dS and (n 0 , 70 ) > 0. Then 
P (T n > e) —> 1 as IV —> 00 for some £ > 0. 

Proof. The case xo G S follows from (11). Suppose then that xq G dS 
and (no, 70 ) > 0. Then, since dS is C 1 at xo, for all e > 0, there exists (5(e) > 0 
such that, for all x G S with \x — xo| < <5(e), and all v G M d , 

(20) |u|<(5(e) and (no,v)>e\v\ ==> x + v€S. 

Since (no, 70 ) > 0, by Lemma 5.3, given e > 0 there exist £1 > 0 and No such 
that, for all N > N 0 and t < T N A £\, 

{ n 0il?) > eibfl. \lt I < V £ d 

with probability exceeding 1 — e. Choose £2 G (0,£ 1 ) so that |xt — xo| < <5(ei) 
and xt G S whenever t < £ 2 . Set N\ = max{lVo, (ei<5(ei)) -2 }, then, for N > 
N\ and t < T N A £2 , 

xt G S, \x t -xo\ <<5(ei), N~ l/2 \^\ < <5(ei), 

( } (n 0 ,^)>£ 1 \^\, 

with probability exceeding 1 —e. By (20), (21) implies Xf = xt + N~ 1 ^ 2 ^ G 
S. Hence F(T N < e 2 ) < e for all N>N X . □ 

For the rest of this section we assume that m > 1. (The next result holds 
with n replaced by r when m = 0, by the same argument, but we do not 
need this.) 
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Lemma 5.5. Suppose either xq G S, or xq G dS and (no, 70 ) > 0. Then 


7 ^ —* 7 n in distribution as N —> 00 . 


Proof. By Lemma 5.3, given 5 > 0, we can find t < T\ such that, for all 


IV, 


ni7f - 7^1 > 8) < 5, P(l7t - \>b)<5. 


Hence it suffices to show 7 ^ —> 7 1 in distribution for all t <T\. 

Define (ipt )t< T in ® (M d )* by 

ipt = Vb(x t )ip t , ipo = id. 

Fix 0 G (R d )* and set 9 t = (V’t )~ 1 9. Then 

d{0 t ,Tt) = (0t,<?(xt)dB t ), t<r , 

so 

(0t,lt) ~n(^(0,~/ o ),J q (9 s ,a(x s )9 s ) ds'j , t<r. 

On the other hand, for 0 as in the proof of Proposition 5.1, 

d(6 t , 7 *) = , dMf) + ’ 0 dt, t<T N , 

where 

= ^N(e t , b N (X») - b{x t ) - Vb(x t )(X t N - x t )). 

By (15), 

sup ^N\b N (X t N )-b(X t N )\-+0. 

t<T N 

By (16), given e > 0, there exists 6 > 0 such that, for all t G [e,ri — e], for 

|a: — xt\ < S, 

| b{x) - b(x t ) - Vb(x t )(x - x t )\ < e\x - x t \. 

Hence \X^ — x t \ < 8 and e < t < 77 — e imply 

VN\b(X t N ) - b(x t ) - Vb(x t )(X t N -x t )\< e\^\. 

Combining this with Lemma 5.2, we deduce that 

f dt —> 0 in probability. 

Jo 

Hence it suffices to show, for all 9 G (R 0 *)* and all t < 77 , 

Vn J (9 s ,dM^) —> N ^0, J (9 s ,a(x s )9 s ) ds^j in distribution. 
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Indeed, it suffices to show, for all 9 G (M rf )* and t < t\, that E {E^’ e ) —> 1 as 
N —> oo, where 

E?’ 9 = exp{ iVNj\e s ,dM») + i J\e s ,a(x s )9 s ) dsj. 

Set rh N (x, 9) = m N (x, i9 ), rh(x, 9) = m(x, i9) and 


cf> N (x, 9) = [ ( e l{e ’ y) - 1 - i(9, y))K N (x, dy ). 

JR'* 


By (14), for all y <y o, we have 

sup sup \Nrh N "(x, N9) —rh"(x,9) \ —> 0. 

|0|<j? 

Note that 


4> n (x,VN9) + ^(9,a(x)9) 


= [ (Nrh N "(x,VNr9) — rh"(x,0))(9,9)(l — r) dr 

Jo 


so, for all p < oo, 

(22) sup sup 1 ^( 2 ;, VN9) + 4(0, a(a;)0)| —► 0. 

xeS N \e\< P 


Write E^ 6 = E? = Z^A^B^, where 

Z t N = exp^iVN j\d s ,dM ?) - (X?,VN9 s ) ds^, 

A? = exp {^(^(Xf, v/iV0 s ) + ±(0 S , a(Xf )0 S )) dsj, 

Bt =exp|^ ±(0 s ,(a(x s )-a(Xf))0 s )dsj. 

Now (Z^ tN ) t<r is a martingale, as in (5), so E (Zj^ TN ) = 1 for all N. 
Fix t < t. By (22), Zf^ T N is bounded, uniformly in N, and A^ atN —> 1 
uniformly as N —> 00 . Moreover, by (16), B^ atN is bounded uniformly in N 
and converges to 1 in probability, using (9). Hence 

E < Z ,AT»<T» AaT»> - 1 

as N —* 00 . By Lemma 5.4 and (11), F(T N > t) —► 1 for all t < T\. It follows 
that E (E^) —* 1 for all t <T\ as required. □ 


Lemma 5.6. 
as N —> 00 , 


Suppose either xq G S, or xq G dS and (no, 70 ) > 0. Then, 

P((n ri , 7 ^) > 0 and T N < n) —> 0, 

P((n n , 7 ^) < 0 and T N > n) —> 0. 
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Proof. By Lemma 5.5, given e > 0, there exists £\ > 0 and Nq such 
that, for all N > N, o, 

I { n n, Yn) I > £l l^n I> £ i < b£l < V £ i> 

with probability exceeding 1 — e. Then by Lemma 5.3, there exists £2 > 0 
and N\ > Nq such that, for all N > Ni, with probability exceeding 1 — s, 
either 

(23) (n Tl , Yf) > £ 2 1 'It I, I 1 It I < l/£2 for all t G [n — e 2 , n] 

or 

( 24 ) fan, 7%) < - £ 2|Ynl> l7nl <;L / e 2- 

Since 35 is C 1 at x n , there exists 5 > 0 such that 

if x E S and v £ M. d with \x — x T] \ < 8, |u| < 5 and 
(n Tl , v) < £ 2 |u| then x + v G 5 

and 

if v G M d with |u| < 5 and (n Tl ,v) < —£ 2 |u| then x Tl + v S. 

Choose £3 E (0, £ 2 ] such that |xt — x T1 \ < <5 and xt E S whenever t E [t\ — 
£ 3 ,T"i]• Set IV 2 = maxjiVi, (£ 2 5) -2 }. Then, for N > N 2 , since X f N = xt + 
N~ 1 ^ 2 7t V on {T n > t}, (23) implies E 5 for all t E [n — £ 2 ,ti] or < 
t\ — £ 2 ) and (24) implies ^ 5 or < n. We know by Lemma 5.4 and 
(11) that P(T n < n — £ 2 ) —»• 0 as N 00 . Hence, with high probability, 
as IV —> 00 , (n T1 , 7 ^) > 0 implies (23) and then > n, and (n ri , 7 ^) < 0 
implies (24) and then T iV < n. □ 

6. Fluid limit of collapsing hypergraphs. We now apply the general the¬ 
ory from the preceding sections to prove our main results Theorems 2.1 and 
2.2. 

6.1. Levy kernel for collapse of random hypergraphs. In Section 3 we 
introduced a Markov process (A n ) n >o of collapsing hypergraphs, starting 
from Ao ~ Poisson(/3) and stopping when n = | V*|, the number of identifiable 
vertices in Ao- The process (Y ni Z n ) n >0 of patches and debris in A n was found 
itself to be Markov. We now view this process as a function of the initial 
number of vertices N and obtain a fluid limit result when N —> 00 . 

It will be convenient to embed our process in continuous time, by removing 
vertices according to a Poisson process (r , t)t >0 of rate N which stops when 
vt = \V*\. Set 


X? = N~ 1 {y t ,Y Vt ,Z Vt ) 
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and note that X N takes values in 

, I N = {x G M 3 : Nx 1 G {0,1,..., N - 1},Nx 2 ,Nx 3 G Z+} 

1 j U{(l,0,j 3 ):JVi 3 eZ + }. 

The Levy kernel K N (x,dy ) for (X^) t>o is naturally defined for x € I N . If 
x 2 = 0, then K N (x,dy) = 0. If x 2 > 0, then N~ 3 K n (x, •) is a probability 
measure; by Lemma 3.1, it is the law of the random variable J N /N, where 

J N = (l, _i -W N + U N , 1 + W N ), 

W N ~ R(Xx 2 - 1, 1/(X - Nx 1 )), U N ~ P((X - Xx 1 - l)A 2 (iV,Xx 1 )) 
with and independent. 

Recall that R denotes the radius of convergence of the power series /3(t), 
given by (1). We assume, until further notice, that R > 0 and fix f 0 £ (0, R A 
1) and p G (£o> P A 1). 

Lemma 6.1. There is a constant C < oo such that 

|XA 2 (X,n) - /3"(n/N)\ < C(logX) 2 /X 
for all N G N and n G {0,1,..., [Xp]}. 


Proof. Recall that 


A 2 (X,n) = X]T(z + l)(i + 2)/3 i+2 

i=0 


n(n — 1) • • • (n — i + 1) 
X(X- l)--.(X-i- 1)' 


Set M = AlogX where A = (log (R/p)) 1 < oo. Then, for n < [Np ], 

IXA 2 (X, n) — P" (n/X) | 

M An oo 

< ^ (z + l)(z + 2)(3 i+2 5i(N,n) + 2 ^ (z + l)(z + 2)/3j +2 p*(X,n), 

2=1 2 = M +1 


where 

5i(N,n) = 

and 


X 2 


n 


/ n — 1 


(X-i)(X-i-l)XVX-iy "\X-i + l 


n — z + 1 


Pi(N,n) = 


X 2 


/ n 


(X-z)(X-z- 1) VX 
Note that, for j = 0,..., z — 1 and i < M An 

n — j n 


<Cp\ 


N-j X 


< AlogX/X 
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so, making use of the inequality | II a j ~~ II bj | < I a j ~ bj | for 0 < aj , bj < 1 , 

we obtain 

di(N,n) < C(logN) 2 p i /N. 

Hence 

|iVA 2 (7V,n) - (5"{n/N)\ < C{\ogN) 2 0"(p\/N + C(p/R) M 
and (p/R) M = 1/N. □ 

6.2. Fluid limit. The main result of this section is to obtain the limiting 
behavior of {X^ )t> o as N — y oo, which we deduce from Proposition 5.1 
and Theorem 5.1. We present first the calculations by which the limit was 
discovered. 

Note that, as IV —* oo, for x 1 < R A 1, we have W N —> W and U N —> U in 
distribution, where 

W ~ P{x 2 /( 1 - x 1 )), U ~ P((l - x l )(5"(x 1 )). 

Set J = (1, — 1 — W + U, 1 + W). Note also that Xq —> xq = (0,/?i,/?o) and 
\/lV(X 0 A — xo) —■» Yo in distribution, where Yq = 0, Yo ~ 1V(0, /?i), Yo ~ ^(0, /?o)> 
with Yo an d Yo independent. Thus, subject to certain technical conditions, to 
be checked later, at least up to the first time that X^' 1 > R A 1 or X^'~ = 0, 
the limit path is given by x t = b(xt), starting from xq, where 

b(x) = E(J) = ^1, -1 - — + (1 - x 1 )/?"^ 1 ), • 

Fix p’ e (0, oo) and set 

(26) S = {(x\x 2 ,x 3 ): |x x | < p,x 2 € (0,//),x 3 G R}. 

Then b is Lipschitz on S and, for p' sufficiently large, the maximal solution 
on [0, to] to ±t = b(xt) in S starting from x 0 is given by (xt)t< T , where 

x t = (t, (1 - + log(l - t)),P(t) - (1 -t) log(l - t)) 

and 

T = Z* A to- 


6.3. Limiting fluctuations. Set a(x) = E(J ® J). A convenient choice of 
a such that a a* = a is a = (hi, V 2 , V 3 ), where 

O' 


V 1 (x) = 


x* 


1 — X 1 


1 


V 2 {x) = ^(l-x 1 )/^ 1 ) ( 1 ) , 


V 3 (x) = b(x). 
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Note that a is Lipschitz and b is C 1 on S. The limiting fluctuations are given 
by 

d^t = ^2,Vi(x t )dBl + yb{ x t)'ytdt 1 t<r , 

i 

starting from 70 , where B is a Brownian motion in R 3 independent of 70 . 
Note that 


T = (t G [0, r): x t £ S} = ( n [0, t 0 ). 

In cases where T is nonempty, the limiting behavior of (X^)t<t 0 depends 
on the signs of the component of the fluctuations normal to the boundary, 
that is, on ( 7 / : t E T). 

Note that 9t = b(xt)Bf satisfies 

de t = v i {x t )dBf + vb{x t )e t dt. 

This is the part of the fluctuations which reflects our Poissonization of the 
time-scale. Since b 2 (xt ) = 0 for all t G T, it does not affect ( 7 / :t E T). So 
consider 7 / = 7 1 ~ 6t- Then 

dnt = Vi (xt) dBj + V 2 (xt) dB 2 + Vb{xtH dt. 

Note that V ^ 1 (a;) = V] (x) = 0 and V 6 1 (x) = 0, so (jt) 1 = ( 70) 1 = 0 for all t. 
Also db 2 /dx 2 = —1/(1 —x 1 ) and db 2 /dx 3 = 0 . Also x\ = t and x 2 /(l — x\) = 
/3'{t) T log(l — t). The sign of ( 7/) 2 is the same as that of at = ( 7 /) 2 /(l — t). 
We have 


da t = cVy / 2 /(1 - t) + 7 t * 2 /(1 - t) 2 dt 
= (V 2 {x t ) dBl + V 2 (x t ) dB 2 )/( 1 - t) 
so we can write at = W(erf), where IT is a Brownian motion and 

f* P'(s) + log(l - s) + (1 - s)P"(s) 


a t ~ P 1 + [ 
Jo 


1 — s 


■ ds 


1 3'(t ) +log(l -t) + t 


1 -t 


We have shown that (sgn( 7 t 2 ): t 6 T) has the same distribution as (sgn(lV t /( 1 _ t )) :t G 
T). In particular, R( 7 2 = 0) = 0 for all t G T. 

Recall that Z is defined by 


Z = min {z G (\W{z/l — z) < 0} A z*. 


Set 


T n = inf{t > 0: X *' 2 = 0} 
and put Z(to) = Z A to, T N (to) = T N A to- 
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Theorem 6.1. For all 5 > 0 we have 

limsup IV” 1 logP( sup \X t N - x t \ > 6) < 0. 

Y—>oo \t<T N (t 0 ) J 

Moreover, T N (to ) — > Z(to) in distribution as N —► oo. 


Proof. We defined I N , the state-space of (X^) t >o, in (25), and S in 
(26). Set S N = I N n 5. For x £ S N we have 

m N (x,0)= I e {9 ' y) K N (x,dy) = N1K(e {9 ' jN ) /N ) 

J R 3 


SO 


m N (x,d) 
N 


= exp 


0 1 


e 2 + e 3 + b 



i 

N-Nx 1 


e 3 -e 2 \ 

N J 


+ P((N- Nx l - 1)X 2 (N, Nx 1 ), |, 

where, for 9 £ M, we write B(N,p,6) = iVlog(l — p + pe e ) and P(X,0) = 
X(e e — 1). So, by Lemma 6.1, 


sup sup 

x£S n \8\<r)o 


m N (x,N9 ) 

N 


— 777 (x, 0) 


0 


as IV —» oo, for all po> 0, where 
m(x, 9) = E(e^’ J ^) 

= exp -9 2 + 9 3 + p(^ t ^- Ii 9 3 - 9 2 ^j + P(( 1 - x 1 )?" {x l ),0 2 ) j. 
Set 


b N {x) = I yK N (x,dy)=~E(J N ), 

J R3 

then, by Lemma 7.1, 

sup VN\b N (x ) — b(x )| —> 0. 

xeS N 

Recall that 

X?’ 1 = 0, NXq’ 2 ~ P(NP i), NX^’ 3 ~ P(JV/3o), 

and xo = (0, /?i, /3o) • By standard exponential estimates, for all <5 > 0 

limsup JV _1 logP(|JV^ — xo| > 5) < 0. 

N —>oo 


We have now checked the validity of (7), (8), (12)-(16) and (18) in this 
context, so Proposition 5.1 and Theorem 5.1 apply to give the desired con¬ 
clusions. □ 
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Remark 6.1. If z* < 1, then z* < R/\l, so by choosing to £ ( z*,RA 1) we 
get Z(to) = Z and, as N —» oo, with high probability T N (to) = T N . Hence, 
when z* < 1, Theorem 6.1 holds with Z and T N replacing Z(ta) and T N (to). 
In particular, Theorem 2.1 follows. 


6.4. Proof of Theorem 2.2. 

y n 

Let z£(U {z*}. If 2 < 1, then z<HAl so, by choosing to£(z,R A 1) in 
Theorem 5.1, we obtain 

(27) P(\Xp N -x z \<6)->F{Z = z) 


Recall that 


| V N *\ (A^* 
N ,0, ^V“ 


for all sufficiently small 5 > 0. 

It remains to deal with the case z = z* = 1. Note that | | < N and 

| A N*| < | A iV|_ Nqw | A JV| _ p(iv/5(i)) 

so \A n \/N^P(1) in probability as 
N —> oo. It therefore suffices to show, for all <5 > 0 and a < /3(1) — 5, 

/\V N *\ lA^I 

lim inf P -- > 1 — 6 and -- > a 

IV—> oo \ N N 

When combined with (27) this completes the proof as we have exhausted 
the possible values of Z. 

We consider first the case R > 1. We can find to ^ (1 — <5/2,1) such that 
/3(to) > a + 5/2. Note that \X^ — xt 0 \ <5/ 2 implies 

\V n *\/N > Xg’ 1 >t 0 -6/2>l-a, 

lA^I/AT > X^ > /J(t 0 ) - (1 - to) log(l - to) - s/2 > a. 

By Theorem 6.1 


^ >P(Z= 1). 


lim inf P ( sup | X t N 

N^oo \t<t 0 


xt\ < S/2 ) > P(Z > to) > P(Z = 1) 


so we are done. 

Consider next the case R = 0. Fix Me N and set B 3 = j3j if j < M and 
/ 3j = 0 otherwise. Then, with obvious notation, we can choose M so that 
2 q > 1 — S/2, ( = 0 and /3( zq ) > a + 5/2. Hence 


/\y N *\ 

P ( —- >1 — 4 and 



1. 


We can couple A and A so that A(A) = A(A)l| A<M . Then V N * C V N * and 
A iv* < A n*^ go ^jg j g enoU gh. 
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There remains the case R E (0,1). In this case £ is finite. We have assumed 
that R £ C- So we can find p £ (sup£, R) and Me N such that, with obvious 
notation, 


z 0 > l-S/2, ( = (, f3(z 0 ) > a +5/2, 

where P(t),t E [0,1), is defined by 

r/m 


m=Po, /3'(0 )=/?!, /3"(i) = ^ 


M 


2 > 

3 =2 


t < P, 
t> p. 


Consider the collapsing hypergraph (Af) n >o which evolves as (Af)n>o up 
to n = v(p), at which time all hyperedges having at least two vertices and 
originally having more than M vertices are removed, so that Af^ < Af ^. 

After v(p),{ Af ) n > 0 evolves by selection of patches as before. Denote by 
V N * the set of identifiable vertices in and by A N * the corresponding 

identifiable hypergraph. Then 

V N * c V N * an d A n *<A n *. 


A modification of Theorem 5.1 shows that 

P(|^f - x- Z0 \ < 5/2) - P(Z = So) = P (Z = 1) 

with Xf = (| V'^I/JV.O, \A n *\/N) and with 

x t = (t, (1 - t)0'(t) + log(l - t)),j3(t) - (1 -t) log(l - 1)). 

All that changes in the proof is that, for t > p the Levy kernel is modified 
by replacing A 2 by A 2 given by 

\ 2 (N,n) = iATfi + 1)0 + 2)ft +2 n(n-l)-(«-i + l) 

; N{N ~ 1) • • • (X - * + 1) 

The argument of Lemma 6.1 shows that for all p 1 < 1 there is a constant 
C < 00 such that 


\NX 2 (N,n) -pf'(n/N)\ < C/N 

for all A eN and n = {0,1,..., [JVp / ]}. Everything else is the same. Now 
| Xf — x ZQ | <5/2 implies 

\V n *\/N >\V n *\/N = xf ’ 1 >z 0 -5/2>l-5, 

\A n *\/N > \A n *\/N = xf 3 > p(zo) - (1 - So) log(l - So) - s/2 > a , 


so 


lim inf P 

TV—>-oo 


yN* I 

> 1 — 5 and 
N - 



>P(Z= 1) 


as required. 
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